Categorical linear models
ثبت نشده
چکیده
Suppose that we have a continuous random variable Y and a categorical variable C with levels c 1 • Are all of these random variable means the same? In other words, is it true that: µ 1 (Y) = µ 2 (Y) = · · · = µ (Y)? • For which pairs of levels c i , c j are the associated random variable means µ i (Y), µ j (Y) equal? • For each pair of levels c i , c j , what is the difference µ j (Y) − µ i (Y) between the associated random variable means? Examining the data Fitting the model Sampling Variability Assumptions normality, equal variances, independence what if variances aren't equal? if variances depend on mean monoton-ically and data are positive, try log or square root transformations. otherwise , if sample sizes are about equal, know that the result will be inflation of the p value. EXAMPLE: HEIGHTS OF SINGERS BY SECTION. Choir singers are often divided into four sections by voice range. From highest voice range to lowest, these sections are: soprano, alto, tenor, and bass. Are the average heights among these groups different? If so, by how much? (Intuition would suggest some differences, even if only related to the fact that sopranos and altos are usually women, while tenors and basses are usually men.) From the statement of the problem, we determine three non-statistical formulate the questions of interest questions of interest: 1. Are the average heights of all of the sections equal? 2. If not (or for additional information), for which pairs of sections do the average heights differ? 3. Also if not (or for additional information), by how much do the average heights differ within each pair of sections? We translate these into corresponding statistical questions of interest. We state these questions in terms of the variables of interest: section, a categorical variable with levels S (soprano), A (alto), T (tenor), and B (bass); and height, a random variable that gives the height of a randomly chosen chorister from a suitable population of choirs. We also use the notation that sectionS is an indicator variable for the condition that section = S, and similarly for sectionA, sectionT, and sectionB. Also, we use β A , β T , and β B to denote the coefficients of sectionA, sectionT, and sectionB in the (unknown) true linear model of …
منابع مشابه
Determinants of Inflation in Selected Countries
This paper focuses on developing models to study influential factors on the inflation rate for a panel of available countries in the World Bank data base during 2008-2012. For this purpose, Random effect log-linear and Ordinal logistic models are used for the analysis of continuous and categorical inflation rate variables. As the original inflation rate response to variables shows an appar...
متن کاملRelating Categorical Semantics for Intuitionistic Linear Logic
There are several kinds of linear typed calculus in the literature, some with their associated notion of categorical model. Our aim in this paper is to systematise the relationship between three of these linear typed calculi and their models. We point out that mere soundness and completeness of a linear typed calculus with respect to a class of categorical models are not sufficient to identify ...
متن کاملA comparison of statistical models for short categorical or ordinal time series with applications in ecology
We study two statistical models for short-length categorical (or ordinal) time series. The first one is a regression model based on generalized linear model. The second one is a parametrized Markovian model, particularizing the discrete autoregressive model to the case of categorical data. These models are used to analyze two data-sets: annual larch cone production and weekly planktonic abundance.
متن کاملCategorical Models of Explicit Substitutions
This paper concerns itself with the categorical semantics of -calculi extended with explicit substitutions. For the simply-typed -calculus, indexed categories seem to provide the right categorical framework but because these structures are inherently non-linear, alternate models are needed for linear -calculi extended with explicit substitutions. We propose to replace indexed categories by pres...
متن کاملRecursive Path Models when Both Predictor and Response Variables are Categorical
Recursive path analysis is a useful tool for inference on a sequence of three or more response variables in which the causal effects of variables, if any, are in one direction. The primary objective in such analysis is to decompose the total effect of each variable into its direct and indirect components. Methods for recursive analysis of a chain of continuous variables are well developed but t...
متن کاملCategorical Data Analysis
This course introduces principles and analyses related to data with categorical outcomes. This course will consider topics such as probability distributions with categorical data, contingency table analysis, the generalized linear model, logit models and loglinear models. Students are expected to: a) learn to select methods appropriate for a question of interest for data with a categorical outc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010